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Summary 


Feline coronavirus (FCoV) infections are endemic amongst cats worldwide. The 
majority of infections are asymptomatic, or result only in mild enteric disease. However, 
approximately 5% of cases develop feline infectious peritonitis (FIP), a systemic disease that 
is a frequent cause of death in young cats. In this study, we report the complete coding 
genome sequences of six FCoVs; three from fecal samples from healthy cats and three from 
tissue lesion samples from cats with confirmed FIP. The six samples were obtained over a 
period of eight weeks at a single-site cat rescue and rehoming center in the UK. We found 
amino acid differences are located at 44 positions across an alignment of the six virus 
translatomes and, at 21 of these positions, the differences fully or partially discriminate 
between the genomes derived from the fecal samples and the genomes derived from tissue 
lesion samples. In this study, two amino acid differences fully discriminate the two classes of 
genomes; these are both located in the S2 domain of the virus surface glycoprotein gene. 
We also identified deletions in the 3c protein ORF of genomes from two of the FIP samples. 
Our results support previous studies that implicate S protein mutations in the pathogenesis of 


FIP. 
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Introduction 


Coronaviruses are enveloped, positive-stranded RNA viruses. They are generally 
responsible for mild enteric and respiratory infections but they can also be associated with 
severe disease in both humans and animals (Masters & Perlman, 2013). Coronaviruses are 
now recognized as emerging viruses with a propensity to cross into new host species, as has 
been shown by the recent outbreaks of severe acute respiratory syndrome and Middle East 
respiratory syndrome (Coleman & Frieman, 2014). As illustrated in Fig.1 for feline 
coronavirus (FCoV), two-thirds of the coronavirus genome encodes proteins involved in viral 
RNA synthesis. The majority of these proteins are encoded in two 5'-proximal, overlapping 
open reading frames (ORFs), ORF1a and ORF1b, and are translated as polyproteins, pp1a 
and ppíab, which are then processed by virus-encoded proteinases into 16 nonstructural 
proteins (Ziebuhr, 2005) . The remainder of the genome encodes the virus structural proteins 
(S, E, M and N) as well as accessory proteins that are not essential for replication in cell 
culture. The structural and accessory proteins are translated from a 3' co-terminal nested set 


of subgenomic mRNAs (Perlman & Netland, 2009). 


The coronavirus surface or spike (S) glycoprotein is a typical class 1 viral fusion 
protein and it has a central role in the biology of coronavirus infection. Structurally, the 
protein can be divided in to an amino-proximal half, the S1 domain, which contains the 
receptor-binding domain and a carboxyl-proximal half, the S2 domain, which contains 
elements involved in membrane fusion. These elements include heptad repeats, a fusion 
peptide and a carboxyl-terminal, hydrophobic transmembrane domain (Heald-Sargent & 
Gallagher, 2012). In many coronaviruses, the S1 and S2 domains are cleaved from each 
other by a cellular, furin-like enzyme (de Haan et al., 2004). The S protein is also the location 
of both B cell and T cell epitopes that are important in virus neutralization and the recognition 


of virus-infected cells (Reguera et al., 2012; Satoh et al., 2011). 
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FCoVs form two antigenically distinct serotypes; serotype 1, which are difficult to 
propagate in cell culture, and serotype 2, which are the consequence of a double 
recombination between type 1 FCoV' and canine coronavirus (Herrewegh et al., 1998) and 
are relatively easy to propagate in cell culture. FCoV infections are endemic amongst cats 
worldwide, and serological and molecular studies confirm that serotype 1 FCoVs 
predominate (Pedersen, 2014b). In the United Kingdom about 4096 of domestic cats have 
been infected with FCoV and in multicat households this figure increases to almost 9096 
(Addie, 2000; Addie & Jarrett, 1992). The majority of FCoV infections are asymptomatic, or 
result only in mild enteric disease. However, approximately 596 of infected cats develop feline 
infectious peritonitis (FIP), a systemic inflammatory disease that is a frequent cause of death 
in young cats (Kipar & Meli, 2014). Currently, there is no protective vaccine or effective 


treatment for FIP (Pedersen, 2009; 20142). 


The most important questions in FCoV research are why some infected animals 
remain relatively healthy, whilst others develop FIP, and what is the role of the virus in the 
development of disease. It is now widely accepted that, in the vast majority of cases, cats are 
infected by the fecal-oral route with avirulent FCoV strains circulating in the cat population. 
Initially, this virus replicates predominantly in the intestinal epithelium and is shed with the 
feces. Nonetheless, it often leads to systemic infection via monocyte-associated viremia 
(Kipar et al., 2010; Meli et a/., 2004; Porter et al., 2014). At this stage, however, the systemic 
infection is characterized by a relatively low level of virus replication and infection can be 
maintained for a prolonged period of time, possibly involving recurrent viremic events, 
without apparent disease (Kipar et a/., 2010). During replication in the intestine or, potentially, 
within monocytes/macrophages (Pedersen et al., 2012), the virus undergoes mutation and 


viruses with an enhanced tropism for monocytes and macrophages emerge. The altered 





1 In this paper, unless otherwise stated, FCoV will be used to mean serotype 1 FCoV. FCoV 
is also used as a strain designation for the species Alphacoronavirus 1 in the genus 
Alphacoronavirus, family Coronaviridae. 


95 


100 


105 


110 


115 


tropism of these mutants results in their ability to maintain effective and sustainable 
replication in monocytes (Dewerchin et al., 2005). As a direct or indirect result of a higher 
level of virus replication, this now apparently virulent virus leads to activation of monocytes 
(Regan et al., 2009), which can then interact with endothelial cells. This, in turn, mediates 
granulatomous phlebitis and periphlebitis, the morphological hallmark and initiating lesion of 


FIP (Kipar et al., 2005). 


In addition to the virus, the susceptibility of the individual infected cat to disease also 
plays a significant role and it has been shown that age, breed, gender, reproductive status 
and immune response influences the development of FIP (Pedersen, 2014b; Pedersen et al., 
2014). For example, the efficacy of early T cell responses critically determines the disease 
outcome in cats that have been experimentally infected with a virulent serotype 2 strain, 
FIPV 79-1146 (de Groot-Mijnes et a/., 2005). Furthermore, there is individual variation in the 
susceptibility of a cats monocytes to FCoV (Dewerchin et al., 2005). Also, recently, single 
nucleotide polymorphisms in the feline interferon-y gene have been linked to both resistance 
and susceptibility to the development of FIP (Hsieh & Chueh, 2014). Clearly, unraveling the 
relationship between FCoV genotypes and phenotypes, and the complex interactions 


between the virus and host during the development of FIP remains a major challenge. 


One facet of this challenge is to determine the mutations that alter the tropism and 
virulence of FCoV. As a first step, this can be done by comparing the genomic sequences of 
viruses shed in the feces of healthy animals and viruses that predominate within tissue 
lesions of cats that have been diagnosed with FIP. This approach assumes that the most 
highly abundant genome in a population is responsible for a particular disease phenotype, 
which is, however, consistent with our current understanding of FIP epidemiology. Using this 
approach, a recent paper published by Chang et al. (Chang et al., 2012) provided evidence 
for an association between FCoV virulence and amino acid substitutions within the putative 


fusion peptide of the FCoV spike (S) protein. A more detailed examination of samples from 
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FCoV infected cats that did not have histopathological evidence of FIP, led Porter et al. 
(Porter et al., 2014) to conclude that these substitutions were indicative of systemic spread, 
rather than a virus that, without further mutation, is able to cause FIP. As the S protein fusion 
peptide is involved in the fusion of viral and cellular membranes during virus entry, it seems 


plausible that changes within this region may be linked to the tropism of the virus. 


Similarly, Licitra et al. (Licitra et al., 2013) were able to distinguish between FCoVs in 
cats with and without FIP on the basis of one or more substitutions in the amino acid 
sequence that comprises the furin cleavage site within the FCoV S protein. The authors 
demonstrated that these substitutions modulated furin cleavage and suggested that a 
possible consequence of the identified substitutions is an enhanced cleavability by 


alternative, monocyte/macrophage specific proteases. 


Finally, there have been many reports over the years of point mutations and indels in 
the accessory protein genes of FCoVs and claims that these may be linked to the 
development of FIP. Prominent amongst these are reports that truncating and non-truncating 
mutations in the ORF3c gene occur in a significant proportion but not all FCoVs associated 
with FIP (Chang et al., 2010; Pedersen et al., 2012). However, the role of the FCoV 3c 
protein and any relationship to the development of FIP is still unclear. One view is that 
functional 3c protein expression is essential for replication in the gut but is dispensable for 
systemic replication. Thus, once the virus has left the gut there is no further selection 
pressure to maintain an intact 3c gene and mutations will accumulate over time. This 
interpretation does not exclude the possibility that the loss or alteration of the 3c protein may 
enhance the fitness of the virus in the monocyte/macrophage environment but this is not yet 
supported by any convincing evidence. Similarly, whilst the genes encoding the 3a, 3b, 7a 
and 7b proteins clearly have important functions that will impact on virus fitness (Haijema et 
al., 2004), there is, as yet, no evidence that links specific mutations in these genes to the 


development of FIP. 
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In this study, we report the genome sequences of six FCoVs; three from fecal samples from 
healthy cats and three from tissue lesion samples from cats with confirmed FIP. The six 
samples were obtained from cats that were resident at a single-site cat rescue and rehoming 
center in the UK. Our results support and extend previous studies that implicate S protein 


mutations in the pathogenesis of FIP. 
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Results 


FCoV RNA in fecal and tissue lesion samples 

As a first step, we amplified the FCoV RNA in fecal and tissue lesion samples. The seven 
amplicons for each of the fecal-derived RNA samples were of the expected size and were 
produced in approximately equal amounts. In comparison, there was greater heterogeneity in 
the amplicons obtained from RNA isolated from the FIP tissue lesions (Fig. 2). Specifically, 
there was more evidence of non-specific products and, especially in the case of amplicon 6, 
which encompasses the region of the genome encoding the S protein gene, there was less 
product than expected. In this context, we noted that the Ct values were generally higher (i.e. 
less viral RNA) for fecal samples than for samples from the FIP tissue lesions. The mean Ct 
values for the 65F, 67F and 80F fecal total RNA samples were 20.9, 16.9 and 29.0, 
respectively and the mean Ct values for the 26M, 27C and 280 tissue lesion samples were 
14.0, 21.5 and 15.0, respectively. One explanation for the difference in homogeneity of 
amplicons derived from fecal and lesional samples may be that the samples derived from 
lesions contain significantly greater amounts of FCoV subgenomic mRNA than the fecal 
samples, which would be expected to contain mainly virion particles. Also, 
immunohistochemistry identified a large number of macrophages with abundant viral antigen 
(i.e. N protein) within the lesions (data not shown). It is therefore very likely that the RNA 
extracted from the lesions contains much more viral mRNA than the feces. Thus, in the RT- 
PCR reactions that involve RNA from tissues, many of the oligonucleotide primers would 


bind to multiple templates, resulting in a more complex amplicon pattern. 


Assembly of genome sequences 

Using the methods described, we were able to obtain full genome coverage, with a 
minimum depth of 1000 reads at each base across the coding region (Fig. 3). We expect that 
with further optimization, it would be possible to obtain an acceptable level of coverage and 


depth for more than 4 complete genomes per single 316v2 chip. Similarly, it would also be 
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possible to obtain a very high density of reads for a single genome if, for example, the goal 
was to investigate the nature of the viral quasispecies in a particular sample. In our opinion, 
the limiting step in genome sequencing from clinical samples is the production of amplicons 
but, once this has been achieved, the downstream processing is relatively straight forward. 
Our approach was based upon the alignment of sequence reads to a de novo 
assembled target genome and this is dependent upon a relatively high similarity between 
samples. For example, in the case of the 65F, 67F, 26M and 280 samples, the percentages 
of reads that aligned to the 80F target genome were 96%, 95%, 90% and 95% respectively. 
However, only 76.896 of reads from the 27C sample aligned to the 80F target genome. Thus, 
for the 27C sample, the de novo assembly method had to be used. De novo assembly is 
more time consuming and would not be a good approach if every sample had to be analyzed 
in this manner, as would be the case if they were highly divergent. It should also be noted 
that in our analysis, we have only compared genome consensus sequences where each 
position is defined by a single nucleotide. In reality, for any sample, many nucleotide 
positions are represented by a proportion of different nucleotides. In these cases, we have 
taken the majority nucleotide as the consensus nucleotide and have not attempted to 
delineate different populations in the quasispecies. This means that when comparing 
sequences, we are only able to identify mutations throughout the population of genomes and 


do not conclude that any or all of these mutations are found in a single genomic RNA. 


Phylogenetic analysis 

Phylogenetic analysis of the six clinical samples described here, based upon the 
conserved RNA-dependent RNA polymerase (RdRp), shows that they comprise a closely 
related cluster (Fig. 4). As reported by Barker et al. (Barker et al., 2013), there is no evidence 
that the samples derived from FIP or non-FIP animals represent genetically diverse co- 
circulating strains, which provides further support for the "internal mutation" hypothesis. 


However, it is very difficult to exclude the possibility that at least some of the mutations that 
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may contribute to the development of FIP are present in a minor component of the infecting 


population, which is subsequently selected during virus replication in vivo. 


Comparison of FCoV genome sequences from clinical samples 

The genome sequences of the six FCoVs derived from fecal and tissue lesion 
samples were translated into two polyproteins (pp1a and pp1ab), four structural proteins (S, 
M, N and E) and 5 accessory proteins (3s, 3b, 3c, 7a and 7b). We found that amino acid 
differences are located at 44 positions across an alignment of the six translatomes. At 21 of 
these positions, the differences fully or partially discriminate between the genomes derived 
from fecal (i.e. non-FIP) samples and from tissue (i.e. FIP) samples. More specifically, in 
these 21 positions one, or more, of the translatomes from the FIP samples displays an amino 
acid that is not found at the corresponding position in the translatomes from any of the non- 
FIP samples (Table 1). We also identified deletions in the 3c protein ORF of genomes from 
two of the FIP samples. 

The fully discriminatory differences we identified are located at two positions where a 
different amino acid is found in all three FIP translatomes compared to all three non-FIP 
translatomes. The first of these is at nucleotide position 23302 and corresponds to the 
methionine to leucine substitution identified by Chang et al. (Chang et al., 2012). Thus, our 
data support the idea that this substitution may be critical with regard to the pathogenesis of 
FIP. The second fully discriminatory substitution we identified, which was present in all of the 
FIP samples but none of the non-FIP samples, was at nucleotide position 23486 and resulted 
in an isoleucine to threonine substitution in the heptad repeat region 1 (HR1) of the S2 
domain in the FCoV S protein. The possible significance of this substitution is discussed in 
more detail below. 

Apart from the fully discriminatory substitutions described above, Table 1 shows a 
further 19 positions where one or two of the translatomes from the FIP samples displays an 
amino acid that is not found at the corresponding position in the translatomes from non-FIP 


samples. Without any further information, it is difficult to conclude that any of these 
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substitutions, alone or in combination, may be related to the development of FIP. However, 
they should not be ignored. For example, the substitutions resulting from mutations at 
positions 22528 and 22539 both lie within the furin cleavage motif that separates the S1 
(receptor-binding) and S2 (fusion) domains of the FCoV S protein. Both substitutions (R789G 
at P4 and R792S at P1, where P4 and P1 designate positions in the canonical furin cleavage 
motif) would be predicted to alter furin cleavage activity. If this is the case, our results support 
the conclusions of Licitra et al. (Licitra et al., 2013) that identify the furin cleavage site as a 
potentially important region in the development of FIP. Alternatively, it could be argued that 
once the virus has acquired a tropism for the monocyte/macrophage, cleavage at the furin 
recognition motif may no longer be relevant to virus entry and mutations may accumulate 
due to a lack of selection pressure. For coronaviruses such as mouse hepatitis virus (MHV), 
cleavage at the canonical furin motif does not seem to be essential, at least for in vitro 
infectivity (Bos et al., 1997), and recent results suggest that activation of the coronavirus S 
protein fusion activity requires proteolytic cleavage at a different position in the S2 subunit 
(Millet & Whittaker, 2014; Wicht et al., 2014). Finally, Table 1 shows that two of the three 
translatomes derived from the FIP samples have a deletion in the 3c protein gene, which is 
not found in any of the non-FIP samples. In both cases, the deletion of 10 nucleotides leads 
to a translational frameshift that produces a 3c protein truncated eight amino acids 
downstream of the deletion site. 

In addition to amino acid substitutions that partially or fully discriminate between the 
genomes derived from non-FIP and FIP samples, our study has also identified a further 23 
amino acid substitutions that do not discriminate between non-FIP and FIP genomes. These 
are listed in Table 2. These substitutions will not be discussed in detail but it is, perhaps, 
worth noting that the majority are found either in the nsp3 protein or the amino-proximal S1 
region of the S protein. This suggests that these regions may represent the targets of 
particularly strong selective pressures. In the case of the S1 region of the S protein, we 
speculate that this selective pressure is immunological and relates to production of 


neutralizing antibodies. The selective pressures that target the nsp3 protein are unknown. 
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For completeness, we also note that we identified a single G to T mutation in the 3' UTR at 
position 28926 of the consensus sequence derived from the 26M sample that was not found 


265  inany other sample. 
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Discussion 


This study demonstrates an approach to the complete genome sequencing of FCoVs derived 
from clinical material that is achievable in a standard laboratory setting. It is based upon the 
generation of a virus-specific cDNA library using oligonucleotide primer pairs, followed by 
next generation sequencing (NGS) on a commercial platform, and downstream genome 
assembly using free software that will run on a personal computer. This approach was taken 
after we had failed to determine complete genome sequences of FCoV from clinical samples 
using a randomly primed cDNA library followed by NGS (Porter, PhD thesis, University of 
Bristol, 2014). In the study reported here, complete genome sequencing was achieved for six 
FCoVs using only seven primer pairs. However, the samples we used were all collected 
within a few months at a single location, which means that they are less likely to have 
diverged, compared to samples taken at different locations over a longer time period. As the 
number of complete genome sequences for both serotype 1 and serotype 2 FCoVs 
increases, it may be possible to design a set of universal primer pairs that will only require 
minor optimization to successfully sequence any FCoV genome. In our own laboratory, we 
have shown that the seven primer pairs described here are able to produce amplicons of the 
expected size in approximately two-thirds of geographically divergent UK fecal samples 
collected over a 2 year period (unpublished results). 

In addition to confirming earlier findings, the most interesting result of this study is 
undoubtedly the identification of a consistent substitution of isoleucine with threonine at 
amino acid position 1108 in all FCoVs from FIP lesions compared to the fecal samples from 
healthy cats. This substitution is located within the heptad HR1 region of the S2 subunit of 
the FCoV S protein and could be interesting from two points of view. First, we note that this 
amino acid position has been identified as being located in a major T helper 1 epitope (I-S2- 
6, IGNITLALGKVSNAITTTSD) in a type 1 Japanese FCoV (KU-2) that was associated with 
FIP (Satoh et al., 2011). Obviously, further research will be required to ascertain whether 


there is a Th1 epitope spanning this amino acid sequence in non-FIP associated FCoVs, and 
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to determine the quantitative or qualitative effect that may result from the isoleucine to 
threonine substitution. However, de Groot-Mijnes et al. (de Groot-Mijnes et al., 2005) have 
already drawn attention to the relationship between T cell depletion and the enhanced virus 
replication in FIP cases, although the mechanisms of T cell depletion are not yet clear. We 
suggest this is an area of FIP research that merits further study. For example, it would be 
interesting to compare IFN-y production by peripheral blood mononuclear cells taken from 
cats with FIP or healthy, FCoV-infected cats, and exposed separately to relevant HR1 
peptides, the sequences of which are derived from FIP and non-FIP associated FCoVs. 

Second, a quite different interpretation of the HR1 isoleucine to threonine substitution 
is that it may be related to the fusogenic activity of the FCoV S protein. This is because the 
substitution also lies within a stretch of 15 amino acids (NAITTI/TSDGFNTMAS) that are 
found only in alphacoronaviruses and are part of the heptad repeat structure that 
characterizes the HR1 region. Indeed, the isoleucine/threonine position constitutes a residue 
predicted to be located on the hydrophobic interface of the coiled-coil structure. Substitution 
of a hydrophobic residue with a polar, uncharged residue may, at least theoretically, 
significantly influence the intercalation of HR1 and HR2 regions, which is a necessary event 
during membrane fusion. It is also worth noting that a very recent study by Bank-Wolf et al. 
has identified a position two residues downstream of the isoleucine to threonine substitution 
where an aspartate residue was found in all examined non-FIP associated FCoVs (5 from 5) 
but was replaced by a tyrosine in a significant proportion (5 from 9) of the FIP-associated 
FCoVs (Bank-Wolf et al., 2014). Neither the isoleucine to threonine nor aspartate to tyrosine 
substitutions consistently discriminate between FIP and non-FIP FCoVs in the wider 
alignment of 29 type 1 FCoV S protein amino acid sequences that we have examined (data 
not shown) but, again, we think they may represent substitutions that are functionally related 
and could be relevant to the development of FIP. 

The comparative sequence approach taken by ourselves and others has identified a 
number of potentially interesting mutations in the coding sequences of non-FIP and FIP 


associated FCoVs. In the future, this approach can be extended, i.e. a larger collection of 
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well-defined clinical samples should be analysed, and it can be refined. For example, to 
distinguish mutations that may relate to the tropism of FCoVs from those that may relate to 
virulence, we suggest it would be important to obtain sequence data from a virus population 
that infects monocytes but is not able to replicate at a high level. Clearly obtaining 
appropriate clinical samples (e.g., blood monocytes from clinically healthy, FCoV infected 
cats) would not be easy but it would be very illuminating. The idea that a virus has to 
undergo sequential mutation in vivo in order to cause a specific disease is not unique to FIP 
(see, for example, the review on measles virus pathogenesis by de Vries et al. (de Vries et 
al., 2012)) but, we suggest, it deserves closer attention in a number of veterinary and human 
diseases. 

Nevertheless, this sequencing approach is ultimately limited. As has been stated 
before, compelling evidence that any specific mutation in the FCoV genome is important for 
the development of FIP will require the use of well-defined and characterized viruses 
produced by reverse genetics and a valid experimental model of FIP. With respect to reverse 
genetics, there are a number of robust reverse genetic systems available for coronaviruses, 
in general, and for particular strains of FCoV (namely the type 2 FCoV strain 79-1146 and 
the cell culture adapted type1 FCoV strain Black) (Thiel et al., 2014). The pressing need, 
however, is for a robust reverse genetic system that can be applied to field strains of type 1 
FCoV. In our opinion, the bottleneck is not the molecular manipulation of the FCoV genome 
but, rather, the ability to propagate type 1 FCoVs in cell culture without extensive adaptation. 
Although there has been recent progress in the development of enterocyte cell lines that 
propagate type 1 FCoVs (Desmarets et al., 2013), we believe that a more robust cell culture 
System that allows for the propagation of high virus titres and the rescue of both mutated and 
non-mutated virus will be needed. To achieve this, identification of both the cellular receptor 
and attachment factors specific to type 1 FCoVs and the transduction of well-established, 
continuous, feline cell lines that can be easily maintained will be essential. 

The second required element, a valid experimental model of FIP, is also more 


challenging than it may, at first, appear. For example, many of the commonly used animal 
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models of FIP often involve intraperitoneal inoculation. If the natural course of FCoV infection 
involves sequential replication in the gut, low level replication in blood monocytes and high 
level replication in monocytes and macrophages, and each transition is associated with the 
selection of specific mutants, then this has to be reproduced in any valid experimental model. 
In a recent report, Tekes et a/ showed that intraperitoneal infection of cats with a 
recombinant form of the FCoV 79-1146 strain robustly induced FIP (Tekes et al., 2012). 
Strikingly, the virus re-isolated from these cats demonstrated that there had been strong 
selection for a virus that reverted to encode an intact 3c protein. This is, in our view, good 
evidence that FIP results from an infection that involves initial replication in the gut. 

In summary, our results contribute to a better understanding of FCoV genomic 
mutations that may or may not be used as markers of the virus phenotype. It is also clear 
from the results that the relationship between the viral genotype and the development of FIP 
is complex. The further analysis of complete FCoV genomes in defined clinical samples, a 
robust reverse genetics system that can be applied to field strains of serotype 1 FCoV, and 
the development of valid experimental models of FIP will all be needed to throw further light 


on this relationship. 
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Methods 


Clinical samples and RNA extraction 

The samples selected for this study were fecal samples from three healthy kittens and 
post-mortem tissue lesion samples from three kittens with FIP. These samples were all 
obtained from a previously reported epizootic outbreak at a single-site UK feline rescue 
center (Barker et al, 2013). The three tissue lesion samples, designated here as 26M 
(mesentery), 27C (colonic lymph node) and 28O (omentum), were from cats F/FIP, Z/FIP 
and J/FIP in the previous study (Barker et al., 2013) and had been collected within 2 h of 
death, placed in RNAlater (Life Technologies) for 24-48 h at 4'C and then, after discarding 
the RNAlater, stored at -80'C. The diagnosis of FIP was confirmed by post mortem 
examination including histopathology and immunohistochemistry for the demonstration of 
FCoV antigen in lesions (Kipar et al., 1998). The fecal samples (65F, 67F and 80F, 


previously named #65, #67 and #80) were collected from the healthy cats within one 


month of euthanasia of the cats with FIP (Barker et a/., 2013). Samples 80F and 27C were 
from cats that were littermates and were housed within the same pen. All three cats that 
provided fecal samples remained alive and without any clinical signs that could be 
suggestive of FIP for over 1 year post sampling. Fecal samples were stored at -80'C 
immediately after collection. 

Total RNA was extracted and purified from 20 mg of tissue with a NucleoSpin RNA kit 
(Macherey-Nagel) based on the method described by Dye and colleagues (Dye et al., 2008; 
Dye & Siddell, 2007). Briefly, 20 mg of each tissue sample was disrupted in a 2 ml tube by 
adding 500 ul lysis buffer containing 1% B-mercaptoethanol (v/v) and a 5 mm stainless steel 
ball bearing. The sample was homogenized using a TissueLyser II (Qiagen) at 30 Hz for 2 
minutes and 470 ul of lysate was added to a filter column and centrifuged for 30 seconds at 
10,000 x g. A 350 ul aliquot of the filtrate was added to 250 ul of ethanol and run through a 
binding column to which DNase | was added to remove genomic DNA. Following multiple 


washes, the RNA was eluted into 50 ul nuclease-free water. The NucleoSpin RNA kit was 
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also used to extract RNA from fecal samples using a method based on that described by Dye 
et al. (Dye et al., 2008). A fecal suspension was produced by vortexing 0.5 g feces and 4.5 
ml phosphate buffered saline 5 times for 30 seconds. Subsequently, 100 ul of this 
suspension was centrifuged for 2 minutes at 10,000 x g, and the supernatant removed and 
added to 350 ul of lysis buffer containing 1% B-mercaptoethanol (v/v). The protocol 


described above (from the filter column) was then followed. 


Histology and Immunohistochemistry 

Formalin-fixed tissue samples (26M, 27C, 280) were routinely paraffin wax 
embedded and examined histologically to confirm the presence of typical FIP lesions. The 
immunohistochemistry served to demonstrate FCoV antigen within lesions, as described 


previously (Kipar et a/., 1998). 


Quantitative RT-PCR and virus-specific oligonucleotide primer design 

FCoV RNA was amplified from fecal and tissue samples using quantitative reverse 
transcriptase-polymerase chain reaction (qRT-PCR) as described previously (Dye et al., 
2008; Porter et al., 2014). Oligonucleotide primer pairs (Table 3) were designed to produce a 
total of 7 RT-PCR products (amplicons) spanning the entire coding region of the FCoV 
genome using the MacVector Primer 3 software package. Initially, the primers were designed 
based on the genome sequence of FCoV C1Je, a serotype 1 FCoV (Dye & Siddell, 2007). 
The primers were then compared to an alignment of 29 serotype 1 FCoV genome sequences 
(ClustalW, available upon request) and optimized to allow for sequence variation and 


compatibility of the primer pairs. All primers were synthesized by Eurofins MWG Operon. 


One-Step RT-PCR 
FCoV-specific primers were used to reverse transcribe and amplify the viral RNA 
contained in 2 ul of extracted total RNA using the SuperScript IIl One-Step RT-PCR System 


with Platinum Taq High Fidelity (Life Technologies) as described by the manufacturer. 
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Briefly, a 50 ul reaction was set up on ice containing 2 ul RNA, 1 ul of 10 uM forward and 1 ul 
of 10 uM reverse primer, 25 ul 2X reaction mix (as supplied by the manufacturer), 2 ul 
SuperScript Ill RT/Platinum High Fidelity enzyme mix and water to a final volume of 50 ul. 
The reaction was incubated at 50°C for 50 minutes to allow cDNA synthesis, and then raised 
to 94°C for 2 minutes, followed by 41 cycles of 94°C for 15 seconds, 50-66°C (depending on 
the primer set) for 30 seconds and 68°C for 1 min/kb of product size. The annealing 
temperature for individual reactions was determined by the melting temperature of the 
primers used. The reaction underwent a final extension phase at 68°C for 7 minutes and was 
held at 4°C. For each amplicon, 5 ul of the PCR product was separated on a 196 agarose- 
TBE gel to confirm the PCR product size and to estimate the amount of DNA by comparison 
with standards. The PCR products were then pooled in approximately equimolar amounts 
and purified using Agencourt AmPure XP beads (Agencourt AMPure XP PCR Purification, 


Beckman Coulter), following the manufacturer's protocol, and eluted in nuclease-free water. 


Next generation sequencing 

Purified, pooled amplicons were sequenced at the University of Bristol Genomics 
Facility using the lon Torrent platform (PGM with the 316v2 chip). A targeted, virus-specific 
cDNA single-end read library was produced. Briefly, DNA was fragmented using the lon 
Xpress Plus Fragment Library Kit, ligated to lon-compatible barcoded adaptors and size- 
selected for a target read length of 150-200 bases. The library was then amplified and 
purified using the lon Plus Fragment Library Kit and the Agencourt AMPure XP Kit. The 
barcoded libraries were quantified and pooled in equimolar amounts using Bioanalyzer 
quantitation. Templates were prepared from the barcoded, pooled libraries using the lon 


OneTouch 2 System. Routinely, four genomes were sequenced on a single 316v2 chip. 


Bioinformatics 
Sequence data were analyzed using bioinformatics tools including both de novo 


assembly (Trinity, http://trinityrnaseq.sourceforge.net/ and genome alignment (Bowtie2, 
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http://bowtie-bio.sourceforge.net/bowtie2/index.shtml) methods. Briefly, for samples 80F and 
27C, a de novo consensus sequence was produced from the FASTQ reads using the Trinity 
assembled components and the MacVector assembly project tool (Grabherr et a/., 2011). In 
order to identify and correct possible errors in this assembly, the same FASTQ sequence 
files were then aligned to the assembled consensus sequence using Bowtie2. The 
alignments were visualized using the Integrative Genomics Viewer (IGV) and the consensus 
sequence manually corrected on the basis of the sequence reads. Subsequently, the FASTQ 
sequence reads for four samples (65F, 67F, 26M and 280) were aligned to the corrected 
80F consensus sequence using Bowtie2. Again, IGV was used to confirm each consensus 
sequence with regard to the relevant sequence reads. All of the assembled genome 
sequences were examined and confirmed to have the expected FCoV genome architecture 
and predicted ORFs. This workflow is illustrated in Fig. 5. For selected viral genes, the 
encoded protein sequences were derived and phylogenetic reconstruction was done using a 


neighbor-joining algorithm based upon an alignment generated by ClustalW (MacVector). 
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Tables 


Table 1. Amino acid substitutions that partially or fully discriminate between the genome 
sequences derived from non-FIP (fecal) and FIP (tissue lesion) samples. The positions of the 
substitutions are indicated as the position of the relevant mutation/substitution based upon 
an alignment of the six FCoV genomes analyzed in this study (Supplementary File 1). The 


amino acid positions in the non-structural replicase proteins (nsps) refer to pp1ab. 










































































Nucleotide Protein 65F | 67F | 80F | 26M | 27C | 280 | Amino acid 
position position 
1758 nsp2 | | | V V | 549 
1794 nsp2 G G G G G R 561 
6553 nsp3 D D D G G D 2147 
14727 nsp12 Y Y Y F F Y 4872 
17883 nsp14 T T T | | T 5924 
19277 nsp16 N Nt N H H N 6389 
21370 S S S S A S S 403 
21377 S | | | | | T 405 
22291 S F F E F F L 710 
22361 S S S S | | S 733 
22528 S R R R G G R 789 
22539 S R R R R R S 792 
22757 S S S S F F S 865 
23302 S M M M L L L 1047 
23486 S | | | T T T 1108 
23589 S K K K N N K 1142 
24190 S P P P P P S 1343 
24298 S E E E Q Q E 1379 
25447 3C T T T T T M 165 
25580-25589 | 3C (-) (-) (-) (+) (+) (-) Deletion 
27228 N S S S L L S 170 
28759 7B L L L F L LS [198 
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* The consensus nucleotide constituted 8396 (26M) and 55% (27C) of the sequence reads at 
this position. 

t The consensus nucleotide constituted 60% of the sequence reads at this position. 

* The consensus nucleotide constituted 7596 of the sequence reads at this position. 

$ The consensus nucleotide constituted 85% of the sequence reads at this position 

In all other cases, the consensus nucleotide constituted more than 9696 of the sequence 
reads at a given position. 

(-) 3c protein gene was complete 

(+) 3c protein gene had a deletion. The deletions were: 26M and 27C, nt 25584-25593 (10 
nucleotides, AGGAGTTTAC). 
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Table 2. Amino acid substitutions that do not discriminate between the genome sequences 
derived from non-FIP (fecal) and FIP (tissue lesion) samples. The positions of the 
substitutions are indicated as the position of the relevant mutation/substitution based upon 
an alignment of the six FCoV genomes analyzed in this study (Supplementary file 1). The 


amino acid positions in the nsp proteins refer to pp1ab. 
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Nucleotide Protein 65F |67F | 80F | 26M |27C | 280 | Amino acid 
position position 
813 nsp2 V | | | | V 234 
1794/1795 nsp2 E G G G G R 561 
2955 nsp3 A T T T T A 948 
3082 nsp3 R Ki K K K R 990 
3797 nsp3 Q Q H Q Q Q 1228 
5218 nsp3 V A A A A V 1702 
5337 nsp3 L M M M M L 1742 
6804 nsp3 A S A A A A 2231 
8939 nsp5 K K N N N K 2942 
14564 nsp12 A A P P P A 4818 
15185 nsp13 | | L | | | 5025 
20509/20510 |S A P IL P P A 116 
20584 S D N N N N D 141 
20861 S S S5 | N S S S! | 233 
20864 S R Q R Q Q R 234 
20866 S | LT | l | L 235 
21275 S R R’ lQ R R R 371 
21467 S | T T T T | 435 
22151 S R K |K R R R 663 
22332 S | p M | | | 723 
27273 N L Q Q Q Q L 185 
27873 7A H YF |H Y Y H 6 
































* The consensus nucleotide constituted 71% of the sequence reads at this position. 
t The consensus nucleotide constituted 72% of the sequence reads at this position. 
* The consensus nucleotide constituted 71% of the sequence reads at this position. 
$ The consensus nucleotide constituted 68% of the sequence reads at this position. 
! The consensus nucleotide constituted 8596 of the sequence reads at this position. 


' The consensus nucleotide constituted 71% of the sequence reads at this position. 
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* The consensus nucleotide constituted 6096 of the sequence reads at this position. 
- The consensus nucleotide constituted 59% of the sequence reads at this position. 
™ The consensus nucleotide constituted 56% of the sequence reads at this position. 
* The consensus nucleotide constituted 78% of the sequence reads at this position. 


In all other cases, the consensus nucleotide constituted more than 96% of the sequence 


reads at a given position. 
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Table 3. Sequences of oligonucleotide primers used in this study. All oligonucleotides are 
shown as 5' to 3' sequences. The positions of the oligonucleotides are given relative to the 


genome of FCoV C1Je [GenBank: DQ848678]. 
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Name Amplicon Sequence Position in | Length 
C1Je (nt) 
F169 1 TAGGAACGGGGTTGAGAG 169-186 18 
R6507 1 GTGCGAGAACRGCCTTAA 6456-6467 | 18 
F5562 2 GTTTGAAYTCACGTGGYCATT 5511-5531 | 21 
R7490 2 GARGTCTTCATCWGAACCCAC 7441 21 
F6943 3 GCTAGTGTTAGAAATGTCTGTGTT 6932 24 
R12466 3 AAAAGCCCTACTAACGTGGTC 12421 21 
F12224 4 CATCCTGCAATTGAYGGATTG 12173 21 
R18105 4 TCCGGGTACATGTCTACGTTA 18054 21 
F17830 5 GATTGGTCCATTGTGTACCC 17782 20 
R20131b | 5 AAARCCTTCCGATGACGAGGT 20080 21 
F19786b | 6 GTATTAAGRAGATGGTTGCCA 19735 21 
R26007 6 ATAACCGCATGAGAAAAGGCT 25793 21 
F24798 7 TAAAATGGCCKTGGTATGTGT 24601 21 
R29508 7 TAGCTCTTCCATTGTTGGCTC 29167 21 
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Figure legends 


Figure 1. Genomic organization of feline coronavirus. Genomic ORFs are shown as boxes. 
Only pptab is shown as a translation product of the genomic RNA. The non-structural 
proteins nsp1—11 are translated from ORF1a (dark grey) and translation of the ORF1b 
proteins (nsp12—16) occurs following -1 ribosomal frameshifting (RFS). Nsp 11 is not 
depicted as it represents a short (nine amino acid) carboxyl extension of nsp10. NspS9, 
single-stranded RNA-binding protein (SSRBP); nsp12, RNA-dependent RNA polymerase 
(RdRp); nsp13, helicase (Hel) and NTPase; nsp14, 3'—5' exoribonuclease (ExoN) and N7- 
methyltransferase (N7-MT); nsp15, uridylate-specific endonuclease (NendoU); nsp16, 2-O- 


methyltransferase (2- OMT). 


Figure 2. Agarose gel electrophoresis of PCR amplicons 1-7 for tissue lesion samples 26M, 


27C, 280 and fecal samples 65F, 67F, and 80F. 


Figure 3. Coverage of sequence reads across the assembled FCoV genomes from fecal and 
tissue lesion samples. Sequence reads were aligned against the de novo assembled 80F 
target genome for the feces-derived samples 65F (A), 67F (B), 80F (C) and tissue lesion- 


derived samples, 26M (D), 27C (E), and 280 (F). 


Figure 4. Phylogenetic analysis of the core RNA-dependent RNA polymerase domain of 
nsp12 (amino acids 4503 to 4807 in ppíab, Supplementary file 1) for FCoV strains 
sequenced in this study and selected FCoV genome sequences. The phylogenetic tree was 
constructed by the neighbor-joining method from an alignment made with ClustalW 
(MacVector). GenBank accession numbers are shown for all sequences. Bootstrap values 


exceeded 609^ at all nodes. 
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Figure 5. Sequence assembly workflow for FCoV genomes. Fecal samples 65F, 67F, 80F 


and tissue lesion samples 26M, 27C, 280. ORFs, open reading frames. 


Supplementary File 1. A nucleotide alignment of the six type 1 FCoV consensus genome 


sequences determined in this study. The alignment was produced using ClustalW 


(MacVector). 
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